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* ABSTRACT 

Many commercial processors now offer the possibility of extending their instruction set for a specific 
application---t hat is, to introduce customised functional units. There is a need to develop algorithms 
that decide automatically, from high-level application code, which operations are to be carried out in 
the customised extensions. A few algorithms exist but are severely limited in the type of operation 
clusters they can choose and hence reduce significantly the effectiveness of specialisation. In this 
paper we introduce a more general algorithm which selects maximal-speedup convex subgraphs of 
the application dataflow graph under fundamental microarchitectural constraints, and which improves 
significantly on the state of the art. 
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